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AMENDMENTS TO CLAIMS : 

The listing of claims will replace all prior versions, and listings, of claims 

in the application: 

T JSTING OF CLAIMS : 

1. (Currently Amended) A document categorizing method for categorizing a 
plurality of documents into a plurality of clusters according to semantic 
similarity, said method being characterized in that: 

after categorizing said plurality of documents into a plurality of clusters 
according to semantic similarity, a cluster merging process is performed such 
that relations among clusters of said plurality of clusters are evaluated on the 
basis of documents included in the respective clusters, and two or more clusters 
having a degree of relation equal to or higher than a predetermined value are 

combined together: 

wherein said cluster merging process defin es said degree of relation 
between multiple clusters under considera t ion as the number of distinct files 
common to all of said clusters under consid e ration multiplied by a predefined 
multiplication factor divided bv a to tal sum of all the files in said clusters under 
consideration . 

2. (Currently Amended) A document categorizing method according to Claim 
1, wherein said multiplication factor is equal to the number o f clusters under 
consideration, paid cluotcr merging proeooo io performed ouch that the evaluation 
of relations among cluGtcro undor conoidcration an to whether they should bo 
merged or not io performed on the basin of the number of documcnto commonly 
included in paid cluotoro under conoidcration relative to the total number of 
documents included in oaid cluctcro under conoidcration, and cluoter merging is 
performed in accordance with the evaluation rcault. 

3. (Previously Presented) A document categorizing method according to Claim 
1, wherein said cluster merging process is performed such that the manner in 
which feature elements, which characterize respective clusters under 
consideration as to whether they should be merged or not, appear in said 
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respective clusters under consideration is examined, and cluster merging is 
performed in accordance with the manner in which the feature elements appear. 

4. (Previously Presented) A document categorizing method according to Claim 
1, wherein said cluster merging process is performed on at least two clusters, 
and after completion of said cluster merging process a first time, said cluster 
merging process is repeatedly performed on the resultant set of clusters until no 
further cluster merging occurs. 

5. (Previously Presented) A document categorizing method according to Claim 
1, wherein after completion of said cluster merging process, supplementary 
information indicating that cluster merging has been performed and also 
indicating the basis on which the cluster merging has been performed is output. 

6. (Cancelled) 

7. (Currently Amended) A document categorizing method according to Claim 6, 
for categorizing a plurality of docum e nts into a plurality of clusters according to 
semantic similarity, said method being characterized in that: 

after categorizing said plurality of documents into a pl urality of clusters 
according to semantic similarity, a c l uster merging process is performed such 
that relations among clusters of said plurality of c luster s are evaluated on the 
basis of documents included in the res pective clusters, and two or more clusters 
having a decree of relation equal to o r higher than a first predetermined value 
are combined together; 

wherein said generated information, which io output do n o to enable the 
uocr to dog in what manner oaid cluster merging procooo hao boon performed, is 
prooontod by modifying the manner of dicplaying the cluster names of respective 
clusters merged together in accordance with the degree of relation among o aid 
cluotoro merged together, oaid cluster namco being are_displayed in-such a 
m an n er t hat when said degree of relation among said clusters is higher than a 
second p redetermined hi pW t.han said first predetermined value , said 

cluster names are displayed in an-a_AND-first listing format, and when said 
degree of relation among said clusters is lower than said second p redetermined 
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^nrl hi gher than said first predetermined v a l ue, said cluster names are 
displayed in ^-OR -« second listing f ormat, 

8. (Currently Amended) A document categorizing method according to Claim 7, 
wherein when said cluster names are displayed in said ANP -first listing f ormat, 
said cluster names of the respective clusters are displayed successively in a 
single horizontal line or are displayed successively in different lines, and when 
said cluster names are displayed in said OR-second listing format, a delimiter is 
inserted between adjacent cluster names of the respective clusters. 

9. (Previously Presented) A document categorizing method according to Claim 
7, wherein when a first cluster includes a second cluster therein, the name of 
said second cluster included in said first cluster is enclosed within brackets and 
placed after the name of said first cluster. 

10. (Currently Amended) A document categorizing apparatus for categorizing a 
plurality of documents into a plurality of clusters according to semantic 
similarity, said apparatus comprising: 

a clustering unit for categorizing a plurality of documents into a plurality 
of clusters in accordance with semantic similarity; and 

a cluster merging unit wMeh-for_evaluatinges the relation among the 
plurality of clusters created by said clustering unit on the basis of the documents 
included in the respective clusters and then combines two or more clusters 
having a degree of relation equal to or higher than a predetermined value: 

wherein said cluster merging unit define s s aid degr ee of relation between 
multiple clusters under consideration as the number of distin ct, files common to 
all of said dusters under considera t ion multiplied by a predefined multiplication 
factor divided bv a total sum of all t h e files in said clusters under consideration . 

11. (Currently Amended) A document categorizing apparatus for categorizing a 
plurality of documents into a plurality of clusters according to semantic 
similarity, said apparatus comprising: 

a clustering unit for categorizing a plurality of documents into a plurality 
of clusters in accordance with semantic similarity, 
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a cluster merging unit wMefe-fpjijivaluatingee the relation among the 
plurality of clusters created by said clustering unit on the basis of the documents 
included in the respective clusters and then combines two or more clusters 
having a degree of relation equal to or higher than a first predetermined value; 

a clu ste r m e rgin g p ror^ ^fminotinn gnnnrrator for generating cluotor 
merging proccop information rcprooonting which cluotoro have been merged 
together a n d al c o rep-""™* 1 '™ fhf > rln.fn-n . nr , of relation among the merged cluotors - 
wherein paid cluotor merging procooo information io to be displayed when final 
clusters obtained via oaid cluotor merging procooc performed by oaid cluster 
merging unit arc displayed go that a user can coo in what manner paid cluster 
merging proccao hap been performed to obtain oaid final cluotor; and 

categorization result outputting means for outputting oa id cluotor 
merging procooo information ouch that paid cluotcr merging procooo information 
io included in the categorization rcoult to be presented to oaid uoer. 

an out put disnlav for displaying the clus t er nam es of respective clusters 
merged together such that when sa id degree of relation among said clusters is 
higher than a second predetermined v alue higher than said first predetermined 
value, said cluster names are displ a yed in a first listing format, and when said 
degree of relation among said clusters is lower than said second predetermined 
value and higher than said first p r edetermined value, said cluster names are 



displayed in a second listing format. 

12. (Currently Amended) A storage medium on which a document categorizing 
program for categorizing a plurality of documents into a plurality of clusters 
according to semantic similarity is stored, said document categorizing program 
comprising: 

a clustering step for categorizing a plurality of documents into a plurality 
of clusters in accordance with semantic similarity, and 

a cluster merging step in which the degrees of relation among clusters of 
said plurality of clusters obtained in said clustering step are evaluated on the 
basis of documents included in the respective clusters, and two or more clusters 
having a degree of relation equal to or higher than a predetermined value are 
combined together; 

Customer No. 20178 7 
Response A.doc 



P5276b 



09/762,126 



Response A 



wherein said cluster merging ste p defines said degree of relation between 
multiple clusters under consideration as the number of distin ct files common to 
all of said dusters under consideration multi p lied bv a predefined multiplication 
factor divided hv a total sum of all the files in said clusters u nder consideration. 

13. (Currently Amended) A storage medium on which a document categorizing 
program for categorizing a plurality of documents into a plurality of clusters 
according to semantic similarity is stored, said document categorizing program 
comprising: 

a clustering step for categorizing a plurality of documents into a plurality 
of clusters in accordance with semantic similarity; 

a cluster merging step in which the degrees of relation among clusters of 
said plurality of clusters obtained in said clustering step are evaluated on the 
basis of documents included in the respective clusters, and two or more clusters 
having a degree of relation equal to or higher than a first predetermined value 
are combined together;_and 

a oluotor merging procooo information generating otcp for generating 
cluotor merging proceco information representing which cluctcro have been 
merged together and aloo rcproocnting the dogrcoc of relation among the merged 
cluotcro wherein ooid cluctcr merging .procooo information ia to be dicplayod 
when final cluotcro obtained via paid cluotor merging procooo performed by said 
cluotor merging otop arc dicplayod oo that a user can oco in what manner sai d 
cluotor merging proccoo hoc boon performed to obtain oaid final cluotor; and 

a step for outputting said cluster-merging-process information ouch that 
paid cluotor merging procooo information io included in the categorization re s ult 
t n h r rr ~ rn 1 ^ + n nmM «ror , wherein the cluster names of respective clusters 
merged together are displayed such that when sa id d e gree o f relation among said 
clusters is higher than a second predetermin ed valu e hig her than said first 
predetermined value, said cluster names ar e dis played in a first listing format, 
and when said degree of relation among sai d clusters is lower than said second 
predetermined value and higher tha n said first predetermined value, said cluster 
names are displayed in a seco nd listing format. 
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14. (New) The document categorizing method of Claim 1, wherein said 
multiplication factor and said number of clusters under consideration is two. 

15. (New) The document categorizing method of Claim 7, wherein said first 
listing format is an AND listing format and said second listing format is an OR 
listing format. 

16. (New) The document categorizing apparatus of claim 10, wherein said 
multiplication factor is equal to the number of clusters under consideration. 

17. (New) The document categorizing apparatus of Claim 10, wherein said 
multiplication factor and said number of clusters under consideration is two. 

18. (New) The storage medium of Claim 12, wherein said multiplication factor 
is equal to the number of clusters under consideration. 

19. (New) The storage medium of Claim 12, wherein said multiplication factor 
and said number of clusters under consideration is two. 

20. (New) The document categorizing apparatus of claim 11, wherein said first 
listing format is an AND listing format and said second listing format is an OR 
listing format. 

21. (New) The document categorizing apparatus of claim 11, wherein when said 
cluster names are displayed in said first listing format, said cluster names of the 
respective clusters are displayed successively in a single horizontal line or are 
displayed successively in different lines, and when said cluster names are 
displayed in said second listing format, a delimiter is inserted between adjacent 
cluster names of the respective clusters. 

22. (New) The storage medium of claim 13, wherein said first listing format is an 
AND listing format and said second listing format is an OR listing format. 

23. (New) The storage medium of claim 13, wherein when said cluster names are 
displayed in said first listing format, said cluster names of the respective clusters 
are displayed successively in a single horizontal line or are displayed 



Customer No. 20178 
Response A.doc 



9 



P5276b 



09/762,126 Response A 



successively in different lines, and when said cluster names are displayed in said 
second listing format, a delimiter is inserted between adjacent cluster names of 
the respective clusters. 
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